PNAS Nexus
Top medRxiv preprints most likely to be published in this journal, ranked by match strength.
Show abstract
The COVID-19 pandemic has highlighted the critical role of genomic surveillance for guiding policy and control strategies. Timeliness is key, but rapid deployment of existing surveillance is difficult because most approaches are based on sequence alignment and phylogeny. Millions of SARS-CoV-2 genomes have been assembled, the largest collection of sequence data in history. Phylogenetic methods are ill equipped to handle this sheer scale. We introduce a pan-genomic measure that examines the infor...
Show abstract
As we begin to recover from the COVID-19 pandemic, a key question is if we can avert such disasters in future. Current surveillance protocols generally focus on qualitative impact assessments of viral diversity 1. These efforts are primarliy aimed at ecosystem and human impact monitoring, and do not help to precisely quantify emergence. Currently, the similarity of biological strains is measured by the edit distance or the number of mutations that separate their genomic sequences 2-6, e.g. the n...
Show abstract
The rapid growth of biological datasets presents an opportunity to leverage past studies to inform and predict outcomes in new experiments. A central challenge is to distinguish which serological patterns are universally conserved and which are specific to individual datasets. In the context of human serology studies, where antibody-virus interactions assess the strength and breadth of the antibody response and inform vaccine design, differences in cohort demographics or experimental design can ...
Show abstract
The dominance of the COVID-19 Delta variant has renewed questions about the impact of K12 school policies, including the role of masks, on disease burden.1 A recent study showed masks and testing could reduce infections in students, but failed to address the impact on the community,2 while another showed masking is critical to slow disease spread in communities, but did not consider school openings under Delta.3 We project the impact of school-masking on the community, which can inform policy de...
Show abstract
The continuing emergence of SARS-CoV-2 variants of concern (VOCs) presents a serious public health threat, exacerbating the effects of the COVID19 pandemic. Although millions of genomes have been deposited in public archives since the start of the pandemic, predicting SARS-CoV-2 clinical characteristics from the genome sequence remains challenging. In this study, we used a collection of over 29,000 high quality SARS-CoV-2 genomes to build machine learning models for predicting clinical detection...
Show abstract
The emergence of novel SARS-CoV-2 variants in late 2020 and early 2021 raised alarm worldwide and prompted reassessment of the management, surveillance, and projected future of COVID-19. Mutations that confer competitive advantages by increasing transmissibility or immune evasion have been associated with the localized dominance of single variants. Thus, elucidating the evolutionary and epidemiological dynamics among novel variants is essential for understanding the trajectory of the COVID-19 pa...
Show abstract
Clinical trials and epidemiological cohort studies often group similar diseases together into a composite endpoint, to increase statistical power. A common example is to use a 3-digit code from the International Classification of Diseases (ICD), to represent a collection of several 4-digit coded diseases. More recently, data-driven studies are using associations with risk factors to cluster diseases, leading this article to reconsider the assumptions needed to study a composite endpoint of sever...
Show abstract
The events of the recent SARS-CoV-02 epidemics have shown the importance of social factors, especially given the large number of asymptomatic cases that effectively spread the virus, which can cause a medical emergency to very susceptible individuals. Besides, the SARS-CoV-02 virus survives for several hours on different surfaces, where a new host can contract it with a delay. These passive modes of infection transmission remain an unexplored area for traditional mean-field epidemic models. Here...
Show abstract
The emergence of variants has shaped the COVID-19 pandemic. The lack of directly observed precursors to these variants has led to proposals that variants emerge from either persistent infections, transmission in non-human animal populations after reverse-zoonosis, or cryptic transmission in the human population. We investigated the origin of variants by analyzing the molecular clock and rate of nonsynonymous and synonymous substitutions in SARS-CoV-2 circulating in human population, persistently...
Show abstract
Frailty indices (FI) based on continuous valued health data, such as obtained from blood and urine tests, have been shown to be predictive of adverse health outcomes. However, creating FI from such biomarker data requires a binarization treatment that is difficult to standardize across studies. In this work, we explore a "quantile" methodology for the generic treatment of biomarker data that allows us to construct an FI without preexisting medical knowledge (i.e. risk thresholds) of the included...
Show abstract
Real world evidence studies of mass vaccination across health systems have reaffirmed the safety1 and efficacy2,3 of the FDA-authorized mRNA vaccines for COVID-19. However, the impact of vaccination on community transmission remains to be characterized. Here, we compare the cumulative county-level vaccination rates with the corresponding COVID-19 incidence rates among 87 million individuals from 580 counties in the United States, including 12 million individuals who have received at least one va...
Show abstract
The data-driven targeting of COVID-19 vaccination programs is a major determinant of the ongoing toll of COVID-19. Targeting of access to, outreach about and incentives for vaccination can reduce total deaths by 20-50 percent relative to a first-come-first-served allocation. This piece performs a systematic review of the modeling literature on the relative benefits of targeting different groups for vaccination and evaluates the broader scholarly evidence - including analyses of real-world challe...
Show abstract
Transmission reconstruction--the inference of who infects whom in disease outbreaks--offers critical insights into how pathogens spread and provides opportunities for targeted control measures. We developed JUNIPER (Joint Underlying Network Inference for Phylogenetic and Epidemiological Reconstructions), a highly-scalable pathogen outbreak reconstruction tool that incorporates intrahost variation, incomplete sampling, and algorithmic parallelization. Central to JUNIPER is a statistical model for...
Show abstract
New and more transmissible variants of SARS-CoV-2 have arisen multiple times over the course of the pandemic. Rapidly identifying mutations that affect transmission could improve our understanding of viral biology and highlight new variants that warrant further study. Here we develop a generic, analytical epidemiological model to infer the transmission effects of mutations from genomic surveillance data. Applying our model to SARS-CoV-2 data across many regions, we find multiple mutations that s...
Show abstract
Despite the growing recognition that social determinants of health (SDoH) play a prominent role in shaping health outcomes, inconsistent measures across health systems and research studies - and the absence of best practices for harmonizing, transforming, or combining variables - limits our ability to incorporate them into disease models. To address this gap, we applied the Healthy People 2030 (HP2030) framework to define, quantify, and incorporate SDoH into composite scores for disease predicti...
Show abstract
The COVID-19 pandemic highlighted the critical need for robust methods to monitor viral evolution and detect emerging variants of concern (VOCs). Traditional genomic surveillance often lacks predictive power. This study expanded an unsupervised machine learning clustering algorithm, based on SARS-CoV-2 Spike protein Levenshtein distance, to track and predict variant predominance across six European countries from 2020 to January 2024. We also investigated the influence of genetic distances and c...
Show abstract
The birth and death of a pandemic can be region specific. Pandemic seems to make repeated appearance in some places which is often attributed to human neglect and seasonal change. However, difference could arise from different distributions of inherent susceptibility ({sigma}inh) and external infectivity ({iota}ext) from one population to another. These are often ignored in the theoretical treatments of an infectious disease progression. While the former is determined by the immunity of an indiv...
Show abstract
As demonstrated by the SARS-CoV-2 pandemic, the emergence of novel viral strains with increased transmission rates poses a significant threat to global health. Viral genome sequences, combined with statistical models of sequence evolution, may provide a critical tool for early detection of these strains. Using a novel statistical model that links transmission rates to the entire viral genome sequence, we study the power of phylogenetic methods--using a phylogenetic tree relating viral samples--a...
Show abstract
Herd immunity refers to the collective resistance of a population against the spreading of an infection as an epidemic. Understanding the dependencies of herd immunity on various epidemiological parameters is of immense importance for strategizing control measures against an infection in a population. Using an age-dependent branching process model of infection propagation, we obtain interesting functional dependencies of herd immunity on the incubation period of the contagion, contact rate, and ...
Show abstract
Influenza is a highly transmissible respiratory virus that can cause devastating pandemics. The last influenza pandemic, caused by a strain of A(H1N1) virus, resulted in millions of cases and hundreds of thousands of deaths worldwide. After the initial outbreaks, many regions experienced subsequent waves driven by environmental conditions, waning immunity, and other factors. More dramatically, several locations saw large resurgent outbreaks, yet the mechanisms behind these severe resurgences rem...